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(57) Abstract: Method to transfer data among different modules of a system connected by a shared bus, particularly to obtain a 
continuous and without inactive period fairly shared use of resources among the modules themselves under traffic-jam conditions, 
which employs a distributed arbitration algorithm, that can be implemented on both hardware and software of the different modules 
of the system. This algorithm ensures a fair and weighed access of the data to the shared bus interconnecting the modules, the weight 
relating to each module/flow being monitored through tags. Said tags are applied to the data flows entering each module, in order 
to monitor the access to the shared bus. 
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METHOD FOR THE FAIR DATA TRANSFER IN A SHARED BUS BY MEANS OF A DISTRIBUTED 
ARBITRATION ALGORITHM 

5 FIELD OF THE INVENTION 

The present invention relates to a method for the continuous and without 
inactive period transfer of data among different entities (which can be both source 
and destination of data), for instance among the modules (computers) of a system 
(like a telecommunication system), which are connected by a shared bus, particularly 

10 in order to obtain a use of resources fairly shared among the modules theirselves 
under traffic-jam conditions. To this purpose, the present invention uses a new 
weighed Round-Robin algorithm, apt to ensure an easy access of data, controlled 
with fairness. Such an algorithm can be implemented on the hardware mechanism 
involved in the arbitration on the shared bus and the suggested solution can be 

15 applied both on the best effort and on the real time traffic, so as to allow fair- 
accesses also under traffic-jam conditions. 

Furthermore, the invention has been implemented to optimise and to make fair 
the use of shared buses for computers in telecommunication systems and the 
resources of systems using data coming from said buses and it is particularly 

20 interesting for multimedia systems. However, it provides very high performances for 
any kind of systems using shared transfer means and can be implemented both in 
hardware and in software. 
BACKGROUND OF THE INVENTION 

One of the objects sought for in systems for data communication is the ability 

25 of transferring information among different entities, reducing the problems due to 
resource share. Namely, each entity should transfer its own data without being 
affected by the presence of other entities. The systems of nowadays should have 
multimedia features; thus data flows with very different features and needs exist. 
Thus, it would be difficult to foresee the impact of certain flows on the latency time 

30 of others in a system with shared resources, especially as per data with real time 
characteristics. 
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The first way to solve the problem was to increase the transmission 
bandwidth of the means. Such a solution proved very useful in systems wherein very 
slow transfer means (e.g. an interconnection bus) embody the actual bottleneck of 
the system. Unfortunately, this is not the case for most of systems, since the 
bottleneck is hidden in the speed of the single entities, particularly in the speed of 
modules which receive data and then send them on other lines, that can be slow. 

Imagine a situation wherein a lot of source entities are transferring data to the 
same destination entity; in this situation, it is apparent that the system can work 
fluently only when the absorption capabilities of the destination are larger than the 
sum of all production capabilities of each source entity. Again, it happens very 
hardly, since normally an entity has the same production and absorption capabilities, 
so that, even if two only entities insist on the same destination, a problem of traffic- 
jam is very likely to occur. 

For instance, three source entities are shown in the following scheme, which, 
insisting on a single destination entity, provide the traffic-jam in the input buffer of 
the latter. 
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The problem may be solved by increasing the capability of the destination 
entity, by introducing registers or buffers only if the traffic-jam phenomena are 
limited within very short times, otherwise also the buffer can saturate. Furthermore, 
under real time traffic, such an increase must not be higher than a certain treshold, in 
order to avoid a too high delay. 

Thus, the saturation of the destination causes traffic-jam and can put into a 
crisis general arbitration algorithms. An arbitration algorithm with fixed priorities is 
not fair and, thus, it is also not effective as for data flows with particular needs. 
Under traffic-jam conditions, only the source entity with the highest priority would 
have the opportunity to drive the destination entity without any intrusion. 
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Mechanisms of balanced arbitration, such as the popular, common Round-Robin, limit 
the problem only partially. Indeed, if the following scheme is taken into account: 
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and reference is made to the saturation condition already set tortn, wnerein source 

entities with different production capabilities are provided, a balanced algorithm 

(Round-Robin) proves not fair in the case of source entities with very different 

capabilities. In particular, such an arbitration algorithm proves not fair for real time 

traffic, which is managed without particular attention, so causing serious latency 

matters. 

One of the best solution presently applied can be a Round-Robin algorithm 
with double priority, wherein the real time traffic can be re-addressed to the chain 
with the highest priority, while the best effort traffic can be attributed to the chain 
with the lowest priority. Unfortunately, this implementation absolutely cannot solve 
the previously mentioned fairness problems for source entities with different 
capabilities. 

Another way to solve the problem is to use TDM (Time Division Multiplexing) 
methods. In this case, a bandwidth is reserved to each data flow, so that different 
bandwidths are available for different needs. By using such a TDM method, the band 
not used by one of the entities is in fact, not usable by the others, thus the method 
itself proves not effective. 

Also other "weighed Round-Robin" algorithms have been developed, for 
instance based on "urgency counters" according to the proposals of the "Computer 
Architecture and VLSI Systems" division of the Institute of Computer Science of 
Crete (see: "Weighed Round Robin Schedular using Per-class Urgency Counters", 
shown on the web of the Institute), but they also show serious limits. The following 
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Table, containing the results of simulations relating to such algorithms, shows in fact 
that a big number of arbitrations have been lost. 
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The example above shows three entities A, B and C with their relevant 
weights, 10, 6 and 4, respectively, acting on the same shared bus. 

The simulation displays ("winner" coloumn) that many arbitrations do not 
elect any winner, since the winner entity should reach a positive value in its urgency 
counter. Furthermore, the following simulation shows that, when an entity 
dynamically reduces its own data transfer speed, the others do not take advantage of 
this availability of bandwidth. 

In fact, a temporary deactivation has been simulated in the following example, 
simply by forcing to zero the urgency counter of the entity A. The table shows* that 
the number of arbitrations won by the entities B and C is not increased, even when 
the entity absolutely does not partecipate to the arbitrations, if the field "decrement 
value" of all of entities is not immediately updated. 
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SUMMARY OF THE INVENTION 

The object of this invention is to provide a new method to transfer data 
among different modules of a system (especially of a telecommunication system) 
which are connected by a shared bus, particularly to obtain under traffic-jam 
conditions, continuously and with no dead time, a use of resources fairly distributed 
among the modules themselves, by employing mechanisms which allow to control 
the whole amount of used data assigned to each module. 

The idea on which this invention is based is to develop a new algorithm for a 
fair arbitration, which can be used both by hardware and software, the concept of 
fairness being the one defined in S. Keshav: "Congestion Control in Computer 
Networks". 

The inventive method is substantially characterised in that it employs a 
distributed arbitration algorithm, that can be implemented on both hardware and 
software of the different modules of the system, which ensures a fair and weighed 
access of the data to the shared bus interconnecting the modules, the weight relating 
to each module/flow being monitored through tags, and in that said tags are applied 
to the data flows entering each module, in order to monitor the access to the shared 
bus, so that, under traffic-jam conditions, a portion of the shared bus is always 
reserved to each module. Said tags are applied by an operator upon the system start- 
up and the tagging can be modified during the operation of the system. 

In substance, according to the method of this invention, a data flow entering 
the system can be tagged with a new information relating to the priority weight 
which is attributed to that flow in the machine. This new priority field can be used by 
all activities - hardware and software - interested to such a flow. 

The following scheme shows what has just been set forth. 
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The field "priority weight" can be used also by the arbitration algorithm of the 
bus system to solve the problems of access to the bus and to the resources of the 
system itself. This kind of algorithm proved really useful to solve the above reported 
conflicts. 

BRIEF DESCRIPTION OF THE DRAWINGS 
In the annexed drawings: 

Fig. 1 is a block diagram of the hardware for fair arbitration according to the 
invention, used on each entity of a multimedia bus; 

Fig. 2 shows the block diagram of the bus arbitrator, with particular reference 
to the transmission part; 

Fig. 3 shows in graphic form the behaviour of the system while each module 
dynamically changes its own requests to the bus; 

Fig. 4 shows how different entities share the bandwidth of a means, by using 
the algorithm according to the invention; and 

Fig. 5 shows how to modify the conditions in Fig. 4 when utilising a CBQ 
algorithm. 

DESCRIPTION OF EMBODIMENTS OF THE INVENTION 

The first application of the present invention relates to the arbitration 
algorithm of a telecommunication system with shared bus. This bus originally was 
divided into two parts, a data part and an arbitration part, the latter transporting the 
information needed to arbitrate the bus for the subsequent cycle during the 
completion of the previous one. 

The following representation illustrates the main timing of a generic 
arbitration, wherein the concept of traffic with different priorities had already been 
introduced. In this embodiment, each destination expresses its own "status of 
receiving queues", so that each requesting entity can send the relevant request only 
if the relative destination is apt to receive data. After this procedure, the arbitration 
logic, which is distributed on each module, univocally determines the winning module 
by using a Round Robin algorithm to solve the concurrent arbitration requests. 
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As above explained, this Round Robin method cannot solve the traffic-jam 
problems in case of source entities with different capabilities. Thus, the Round Robin 
algorithm is improved according to the invention, so as to obtain a fair arbitrationfor 
data flow possessing different features. The following representation shows that the 
above "priority weight", applied with tags to each data flow, is used as an arbitration 
weight. 
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In this way, modules not winning the access to the bus can increase their 
weight for the subsequent arbitration, so that each module can become the one with 
the highest priority after a certain number of cycles, depending on its first "weight 
applied with tags". 

Situations wherein more entities have the same highest weight can be solved 
with the Round Robin method or simply with a method with fixed priority. Behaviour 
simulations of the inventive method evidenced that the method with fixed priority, 
used just to solve the occasional identities in weight, do not affect priority linearity if 
the highest "weight applied with tags" is higher than the number of entities on the 
bus (inverted weights employed: 0 = the highest priority; 255 = the lowest priority). 

The following representation - a detail of the previous one - shows how the 
field "arbitration weight" can be implemented to differentiate two main groups of 



WO 02/28027 



8 



PCT/EP01/11028 



priorities. For instance, real time flows should use tags with the field relative to the 
priority so that real time path always has priority with respect to best effort paths. 
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Simulations of this algorithm, where more entities drive flows towards the 
same destinations showed a direct linearity with respect to the "weight applied with 
tags" and to the number of won arbitrations. Furthermore, when one or more entities 
stop to transmit, the new available bandwidth is linearly shared among the other 
modules, depending on their relative weights. 

Consider, now, Fig. 1 of the annexed drawings, which is a block diagram of 
the hardware for fair arbitration used, according to the invention, on each entity of 
the multimedia bus. 

The interface of the multimedia bus acts as an intelligent peripheral, mapped 
on PCI bus for the main processor; in this case, a dedicated processor is used to 
create and use input/output queues. This processor is able to move data from and to 
the memory of a processor system, by using a DMA controller, which is included in 
the PCI interface controller. A bus controller is used to control input/output queues 
and to arbitrate the bus, by using the mentioned algorithm. 

The multimedia bus operates in the present implementation with signal levels 
GTL + , by using an owner protocol, so that special transceivers are used to interface 
the same. 

All of the input/output buffers are organised on a dedicated, static RAM which 
can be accessed by the bus processor and by the bus arbitrator. More circular buffers 
are implemented in this memory, so as to create different queues for the traffic to be 
transmitted/received, the high/low priority, the accesses to multiple/single destination 
and to every possible destination. The following table scheme shows how to 
physically share the buffer memory. 
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This implementation refers to a twelve-modules system, which results in the 
memory divided into twelve different transmission buffers, in fact, this is important 
to solve the problem known as "head of line blocking"; in this way, when a 
destination to which it is addressed is under saturation conditions, the arbitration 
5 logic can operate with all other possible destinations. Moreover, the dedicated 
multicast buffer, which is implemented in RAM positions, replacing a traditional FIFO, 
allows to divide the multicast data transfer into more cycles when some of the 
destinations to which it is addressed are under engaged conditions. 

To have differentiate buffers, for traffic with respectively high and low priority 
10 reduces of course the minimum latency for real time flows. 

By way of example, each packet of data transfer could be classified as a 
simplified queueing algorithm CBQ (Class Based Queueing), based on managing a 
share of connections and resources with two classes only. The user's packet UDP 
(User Datagram Protocol) could be classified as having high priority, whilst the other 
15 data could be classified as having low- priority. 

This simplified CBQ algorithm should be implemented at a level of driving or 
controlling a hypothetical network, used to connect the system, in order to ensure a 
first level of "service quality". A better mechanism will be implemented as soon as 
sofisticated protocols, such as RSVP with integrated and differentiated services, are 
20 available. 

On each transmission buffer data have the following format: 



Ut 24 |23 16 | 15 S,7 o| 
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The first information of the packet is a control word, used just by the bus 
arbitrator and the information which brings is: 

- destination address: each group of bits addresses to a specific destination, 
for instance 10001010001 will be used to a multicast data transfer towards 

5 destination entities occupying slots 0, 4, 6 2 1 1 ; 

- transfer size: this field is used to provide information about the length of the 
subsequent data packet; in this case an eight-byte granularity has been used; 

- arbitration weight: it is used to define the priority of the arbitration. 

Fig. 2 shows the block diagram of the bus arbitrator; particular attention has 
10 - been devoted to the transmission part therein. 

A hardware finite state machine has been used to determine the status of the 
various buffers implemented in the dedicated SRAM; this hardware is able to realise 
the presence of transmitting and receiving packets by comparing the various queue 
pointers. 

15 When a transmission packet has been found, its control word is read out. This 

information is used to verify the status of destinations to which it is addressed and 

to prepare the transmission counter and the weight register. 

Each entity simultaneously updates the bit with high/low priority and the 

information relating to the weight on a dedicated arbitration signal. Thus, each entity, 
20 since it knows how to give an interpretation about the information coming from all of 

the other entities, identifies the request having the highest priority, then the winner 

of the arbitration. 

Each entity loosing the arbitration decreases its own weight register (denied 
logic) with the weight of the winner; that is basic to ensure system linearity, since 
25 otherwise entities using weight with high priority would be too advantaged. 

Many software algorithms may be used to calculate the weight of a specific 
flow: the most simple way to dynamically define it has been used herein by way of 
example only, by taking into account the number of users connected to each module. 
With such a criterion, if Umx is the number of users connected to a specific 
30 module and Us of users connected to a system, 
Umx= U1 + U2 + Un etc. 
Us = Um1 + Um2 + Umn etc. 
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The complementary value of the weight for each module is given by: 
Wm = (1-(Um/Us)) * max_weight 

For instance, if ten users are connected to module 1 and twenty users are 
connected to module 2: 
5 U1 = 10 
U2 = 20 
Us = 30 

Maxjweight = 255 (8 bits register) 
Wm1 = (1-(10/30)) * 255 =' 170 
10 Wm2 = (1-(20/30)) *255 = 85 

Thus, module 2 has the highest weight. 

The following tables summarise the simulation of a twelve slot system, using 
different weights for the arbitration, the indicated weight value being the 
complementary one, so that the weight 16 is used as the highest one, whilst the 
15 weight 192 is used as the lowest one. The simulation has been carried out without 
any information about real time and best effort kinds, just to show the inventive 
fairness behaviour. 

The calculation of the latency in this example has been based on 64 byte 
packets, moved at a speed of 2.1 Gbit/sec. The table shows the linear ratio between 
20 weight and bus utilisation. 
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The deactivation of the slot 1 has been simulated in the following table. The 
table shows that the accesses not used by slot 1 are redistributed to the other 
entities, while keeping the fairness of the remainder entities. 

30 
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The depiction provided by Fig. 3 graphically shows the system behaviour 
while each module dynamically changes its own requests to the bus. In this example: 
10 1 . All of the twelve active modules. 

2. Deactivation of slot 1 . 

3. Deactivation of slot 3. 

4. Deactivation of slot 5. 

5. Deactivation of slot 7. 
15 6. Deactivation of slot 9. 

7. Re-activation of slot 1. 

The dynamic sharing of the bandwidth, introduced by this algorithm, has a 
direct impact on the latency values of the bus. Of course, higher weights give raise 
to lower latency times, and so on. 
20 For instance, it can be based on 64 bytes packets, moved at a speed of 2,1 

Gbit/sec to calculate latency. This has been done in the following diagram. 
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The use of the distributed arbitration algorithm according to the invention, 
implemented on both hardware and software of the different modules of the system,' 
allows many advantages. 

First of all, it allows the largest use of the capabilities of the available means 
and, by ensuring fair flow priorities, it allows a fair share of resources, especially 
under traffic-jam conditions. 

Fig. 4 in the annexed drawings shows six entities sharing the bandwidth of a 
means, by using the algorithm above defined, in this example the weight of BW1 
being double than the ones of BW2 and BW3, whilst the one of BW4 is double than 
the ones of BW5 and BW6. The hatched bandwidths represent the traffic having 
higher priority (1, 2 and 3), while the others (4, 5 and 6) represent the traffic having 
lower priority. 

The same Fig. 4 shows that, when the traffic having high priority uses all the 
available bandwidth, the other flows (4, 5 and 6} are reduced to zero, whilst, when 
there is a limited traffic having high priority, a redistribution among the means takes 
place also for flows having low priority. Furthermore, it is apparent that the 
relationship between BW1 and BW2/3 is always constant, like the relationship 
between BW3 and BW4/5. 

Conversely, Fig. 5 shows the same circuit when the previously defined CBQ 
algorithm is applied thereto: in such a case, the traffic having high priority cannot 
saturate the bus and a minimum bandwidth, which can be programmed, is reserved 
to flows having low priority. 

It is understood that other applications of the invention are conceivable, which 
can indirectly utilise the described mechanisms. As already mentioned, the arbitration 
behaviour can be controlled by the method of input tagging, so that the load of each 
module can be dynamically balanced simply by applying intelligent algorithms in this 
input step. That is to say, by changing the input data flow for each module, a SW 
algorithm is able to change the work load in each computing entity. 
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CLAIMS 

1. Method to transfer data among different modules of a system connected by 
a shared bus, particularly to obtain a continuous and without inactive period fairly 
shared use of resources among the modules themselves under traffic-jam conditions, 
characterised in that it employs a distributed arbitration algorithm, that can be 
implemented on both hardware and software of the different modules of the system, 
which ensures a fair and weighed access of the data to the shared bus 
interconnecting the modules, the weight relating to each module/flow being 
monitored through tags, and in that said tags are applied to the data flows entering 
each module, in order to monitor the access to the shared bus. 

2. Method as claimed in claim 1., wherein a part of the shared bus is always 
reserved to each module under traffic-jam conditions. 

* 3. Method as claimed in claims 1. and 2., wherein the tags are applied by an 
operator upon the start-up of the system. 

4. Method as claimed in any claim 1 . to 3., wherein the tagging can be 
changed during the operation of the system. 

5. Method as claimed in any claim 1. to 4., wherein the weights relating to 
each flow can be dynamically changed. 
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